Protein fold recognition from secondary structure assignments
نویسندگان
چکیده
A method for finding protein folds consistent with secondary structure assignments and imposed experimental restraints is described. All possible matches between the query pattern and every member of a database of protein structural domains are generated by a comparison of secondary structure assignments. The comparison allows for errors in predicted secondary structure elements and possible variiztions between query and database structure. Several filters remove matches that are ma-compact, that have poor p sheet bonding, that do not allow loop/turn lengths to bridge the distance between connected secondary structures, or that fail to satisfy imposed experimental restraints (e.g. disulphide bonds). The remaining matches provide a set of plausible topologies for a protein, of unknown structure, which can be inspected visually or tested by experiment. A search using the SIC homology 2 domain prediction finds 19 possible topologies, one being a domain from the E. coli bio protein known to adopt an SH2 fold. The use and development of the method are discussed.
منابع مشابه
Protein fold recognition by mapping predicted secondary structures.
A strategy is presented for protein fold recognition from secondary structure assignments (alpha-helix and beta-strand). The method can detect similarities between protein folds in the absence of sequence similarity. Secondary structure mapping first identifies all possible matches (maps) between a query string of secondary structures and the secondary structures of protein domains of known thr...
متن کاملUsing Phylogeny to Improve Genome-Wide Distant Homology Recognition
The gap between the number of known protein sequences and structures continues to widen, particularly as a result of sequencing projects for entire genomes. Recently there have been many attempts to generate structural assignments to all genes on sets of completed genomes using fold-recognition methods. We developed a method that detects false positives made by these genome-wide structural assi...
متن کاملRepresentative Protein Sequence and Structure Database
The database provides the information about the non-redundant protein dataset (1573 proteins) obtained from the Protein Data Bank. The information includes PDB ID, Length of the protein, Resolution, PDB Secondary structure, PDB secondary structure summary, PHD secondary structure prediction, PHD secondary structure prediction summary, sequence. We further revised the PDB Secondary structure sum...
متن کاملSecondary structure and 1H, 13C, 15N resonance assignments of the endosomal sorting protein sorting nexin 3
Sorting nexin 3 (SNX3) belongs to a sub-family of sorting nexins that primarily contain a single Phox homology domain capable of binding phosphoinositides and membranes. We report the complete (1)H, (13)C and (15)N resonance assignments of the full-length human SNX3 protein and identification of its secondary structure elements, revealing a canonical fold and unstructured termini.
متن کاملKnowledge-based protein secondary structure assignment.
We have developed an automatic algorithm STRIDE for protein secondary structure assignment from atomic coordinates based on the combined use of hydrogen bond energy and statistically derived backbone torsional angle information. Parameters of the pattern recognition procedure were optimized using designations provided by the crystallographers as a standard-of-truth. Comparison to the currently ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995